Learning to Ride a Bicycle using Iterated Phantom Induction
نویسندگان
چکیده
We build upon our work on iterated phan. ... . .. . -.. . wm IDQUC1O10D anQ Ule worK 01 naDQltZ1V anQ Alstrem OD -applying reinforcemeni learning to the challenging control task of learning to ride a bicycle. Last year phantom induction was demonstrated on a nonlinear but non-dynamica1 task. The bicycle domain is a dynamical system whose next state is a function of the current state and control inputs.. Randl,v and AlstrfIIm demonstrated this task was learnable using Sarsa(~) with shaping. Our approach integrates domain knowledge into the learning process in a flexible way. By taking advantage of what a domain theory expert can easily express, our .system learns to ride. in a few dozen examples rather than the thousands required by the system of Randl,v and AIstr,m. We also show that the two inputs, shifting the rider's center of mass and deflecting the handlebars, are not needed simultaneously for learning. Our system learns to ride using either input alone. By extending phantom induction into the space of dynamical systems we demonstrate the applicability of our learning technique to an important and difficult class of problems.
منابع مشابه
It Takes Two Neurons To Ride a Bicycle
Past attempts to get computers to ride bicycles have required an inordinate amount of learning time (1700 practice rides for a reinforcement learning approach [1], while still failing to be able to ride in a straight line), or have required an algebraic analysis of the exact equations of motion for the specific bicycle to be controlled [2, 3]. Mysteriously, humans do not need to do either of th...
متن کاملBlitzograms - Interactive Histograms
You probably know how to ride a bicycle, but would you even recognize the equations of bicycle motion? The equations are clearly not necessary for riding a bike, but are they sufficient? Suppose, that upon seeing your first bicycle, you had derived the equations of motion from scratch, starting with F=ma. Would you shout “Eureka” then jump on and start riding? No, the equations of motion are ir...
متن کاملOats ( not just for breakfast anymore )
2 3 The bicycle without a rider balances perfectly well. With a novice rider, it will fall. This is because the novice has the wrong intuitions about balancing and freezes the position of the bicycle so that its own corrective mechanism cannot work freely. Thus learning to ride does not mean learning to balance, it means learning not to unbalance, learning not to interfere.
متن کاملBicycle Access to Public Transportation: Learning from Abroad
w bile the United States has been investing in costly park-and-ride systems that have made transit increasingly dependent on the automobile, European and Japanese communities have been strengthening the potential for people to walk and bicycle to and from transit, boosting ridership at a far lower cost. In Japan and much of Europe, the fastest-growing and often predominant access mode to suburb...
متن کاملA New Method to Enhance the Low Voltage Ride Through Capability of Doubly Fed Induction Generator Using Resistive Type Superconducting Fault Current Limiter
In the event of voltage sag at the nearing of a wind park, severe currents may pass through the power electronic devices of the Doubly Fed Induction Generators (DFIGs). Hence, according to the Low Voltage Ride Through (LVRT) requirements, the wind parks are allowed to disconnect from the network after a certain time. Therefore, to avoid such actions, the LVRT requirement of wind parks is consid...
متن کامل